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Type theories with higher-order subtyping or singleton types are examples of systems where compu- 
tation rules for variables are affected by type information in the context. A complication for these 
systems is that bounds declared in the context do not interact well with the logical relation proof of 
completeness or termination. This paper proposes a natural modification to the type syntax for F®, 
adding a variable's bound to the variable type constructor, thereby separating the computational be- 
havior of the variable from the context. The algorithm for subtyping in F® can then be given on types 
without context or kind information. As a consequence, the metatheory follows the general approach 
for type systems without computational information in the context, including a simple logical relation 
definition without Kripke-style indexing by context. This new presentation of the system is shown to 
be equivalent to the traditional presentation without bounds on the variable type constructor. 



1 Introduction 

Logical relations are a powerful technique for proving metatheoretic properties of type theories. The 
traditional approach to the metatheory of type theories, for example that of Pure Type Systems 1U, 
studies properties of untyped reduction and conversion, and then completes the study of type-checking 
by proving strong normalization with a logical relation construction. 

This approach has been difficult to adapt to systems where variables may behave differently accord- 
ing to context information. Examples where this may occur are type systems with singleton types, where 
a variable of a singleton type is equal to the unique element of the type, or subtyping, where a type 
variable may be replaced by its bound in a derivation of subtyping. The key difficulty is that strong 
normalization of a term or termination of subtyping on higher-order types depends on information in the 
context, but that normalization or termination also needs to be closed under replacement by equal con- 
texts, in order to model the constructors that introduce the computational information into the context. 

For example, in F®, consider a putative proof of strong normalization in the case of a derivation of 
X < A : * — > * h X(C) : *. Such a proof would have a hypothesis that A (C) is strongly normalizing, since 
the model must allow a variable to be replaced by its bound, referred to as promotion. However, to model 
the rule that h VX < A : * -> *.X(C) = VX < B : * -> *.X(C) : *, we would need that B(C) is strongly 
normalizing for arbitrary B such that h A = B : * — > * before constructing the model; the behavior of 
X(C) varies according to its context. 

Several papers have addressed systems of this type, but each of these approaches differs from the 
usual approach to metatheory of type theories. Compagnoni and Goguen Q HI use an algorithm where 
a variable's bound is normalized before promoting the variable. This allows context replacement to be 
proved before the completeness proof, but it seems to be an odd requirement and was only introduced 
to get the proof to work. Furthermore, the algorithm is less efficient than an algorithm that postpones 
normalization of the bound. Stone and Harper |[T3ll prove termination for an algorithm for singletons 
using the unnormalized singleton rather than normalizing it first. Their Kripke-style proof indexes the 
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model with sets of possible contexts in which a term is well-typed. In the example above, the possible B 
are limited by considering contexts that arise from bounds introduced by the V constructor. This differs 
from the standard Kripke-style proof of strong normalization, which is relative to a single context. 

In this paper we propose separating the computational behavior of variables from the context. We 
introduce a modified type structure for F® [3j|4l[TT|], where the type constructor for variables is X& with 
the variable's bound A explicitly mentioned. We call this presentation "a la Church" for its obvious 
similarity to type labels on A -abstractions, and we call the traditional presentation "a la Curry". With 
this change, the term structure tells us how promotion will be used without reference to the context. 

This presentation allows us to give a kind- and context-free definition of the algorithm for subtyping, 
since the only use of the context in the traditional algorithm is when a variable is replaced with its 
bound. This in turn leads to an approach to the metatheory consistent with the usual approach for type 
theories, since promoting a variable to a type convertible with its bound, the cause of the difficulties in 
the system without bounded variables, is never necessary. In our example above, the terms would be 
VX < A : * — >■ *.X A (C) = VX < B : * *.X B {C): the behavior of X A (C) and X B (C) is fixed regardless of 
context. 

While changing the term structure could be considered a syntactic trick, in our opinion our presen- 
tation points to a deficiency in the syntax of the traditional term structure of F®. In general, model 
constructions work best when there is a close relationship between terms and derivations: this is best 
illustrated by Streicher's extended term structure and partial interpretation for the Calculus of Construc- 
tions [14]. The inability to construct traditional models to show decidability of higher-order subtyping 
for the expected algorithm suggests that the type structure of is inappropriate. We believe that the 
trick is that the syntax without bounded type variables works at all. The equivalence of the two pre- 
sentations shows that the additional information necessary for the model construction can be ignored in 
programs. 

The system Full F®, with contravariance in the bounds of quantified types, further illustrates our 
point. Surprisingly, in contrast to the system with unlabeled type variables, the algorithm for subtyping 
for Full F® with bounded variables cannot be defined. Essentially, when the implicit contravariant 
type information in the unlabeled types is made explicit, the side conditions distinguishing variable 
reflexivity from promotion cannot be expressed in a valid inductive definition. We shall discuss the 
technical reasons that the definition fails in more detail when we define the algorithm for Kernel F®. 
However, the inability to define the algorithm for Full F® over the explicit type structure gives a strong 
indication that the unlabeled type structure is an inadequate representation of types for F®, rather than 
the addition of bound information to the type being a trick. 

The correctness of the traditional system a la Curry is a consequence of the equivalence of the two 
presentations. There are two substantial differences in the treatment of the new system. First, because 
variables mention their bound explicitly, the new presentation exposes the difference between the opera- 
tions of renaming [X <— Y] , which changes variable names but does not change the bound, and substitu- 
tion of a variable, [Fg/X], which replaces the bound of X. Secondly, the subtyping judgement is needed 
in the formulation of the inference rules for the kinding judgement, for example in the rule TVar for 
kinding a type variable: this is not necessary in the traditional presentation. 

In this paper we address all of the complications listed above. We first study the properties of the 
subtyping relation, including completeness and correctness, anti- symmetry, transitivity elimination and 
decidability. We complete our development by showing that our system is equivalent to the traditional 
one without bounds. We ignore completely the term language, since its metatheory is standard once 
decidability of subtyping has been proved. As such, we do not treat substitution for bounded variables 
as occur in V, since this substitution only occurs in the reduction relation for terms. 
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2 Syntax 

We now present the term constructors, judgements and rules of inference for kinding and subtyping in 

2.1 Syntactic Categories 

The kinds of F® are the kind * of proper types and the kinds K — > K' of functions on types and type 
operators. We assume an infinite collection of type variable names X, Y, Z, . . .. The types include variables 
with explicit bounds X A ; the top type T*; function types A — > B; and types \/X < A : K.B of polymorphic 
functions, in which the bound type variable X ranges over all subtypes of the upper bound A. Moreover, 
like F m , we allow types to be abstracted on types, of the form AX : KA, and we can apply types to 
argument types A (B). Contexts T, A are either the empty context () or extended contexts F,X < A : K. 

We identify types that differ only in the names of bound variables. We write A(B\,...,B n ) for 
(A(B l ))...(B n ). If A = X c (B u ...,B n ) then A has head variable X c ; we write HV(-) for the partial 
function returning the head variable of a type. We also extend the top type T* to any kind K by defining 
inductively T K ^ K i = AX : K.T K i. We use X : K as an abbreviation for X < T^ : K in contexts; in this 
case we say X is a variable without a bound. 

Because type variables are decorated with their bounds, we need to be careful with our definition of 
substitution: specifically, a renaming should be restricted to renaming the variables in the bound A of 
a variable X A , as opposed to changing the bound as may occur in a substitution of Yg for X in X A . We 
therefore define parallel substitutions y, S as either the empty substitution (); the extension of a parallel 
substitution 7 with a renaming of a variable X by another variable Y, written y[X 4-Y];or the extension 
of a parallel substitution 7 with a substitution of a variable X by a type A, written y\A/X]. We say 7 is a 
renaming if 7 = () or if 7= yb[X <— Y] with 70 a renaming. We write idr for the identity renaming of the 
type variables declared in T. 

We write B[y] for the capture- avoiding simultaneous replacement of each of the variables by its 
corresponding value, defined as follows on variables and lifted in the usual way to arbitrary types: 

• X A [Q]=X A . 

• X A [y[X^Y]]=Y mx ^ Y]] . 

• X A [y[Y <- Z]} = X A[yM] , if X £ Y. 

• X A [y[B/X]]=B. 

. X A [y[B/Y]} =X A[y[B/Y]] , if X^Y. 

Observe that B cannot be a variable Y or Z in the last two equations, but must instead be a bounded type 
variable Yq or Zq. 

We also write B[A/X] for the parallel substitution that is the identity renaming on the free variables 
in B other than X, and the substitution of X by A. We have standard properties of parallel substitution, for 
example that A[y] [8] = A[yo 8} and (y[A/X])o 8 = (yo8)[A8/X]. We also write A \>B for the standard 
notion of one-step j3 -reduction. We have the standard property of Church-Rosser for reduction. 

2.2 Judgements and Rules of Inference 

The judgement forms are F h A : K for well-kinded types and T h A < B : K for subtyping. We sometimes 
write r h ok for T h T* : *, formalizing the well-formedness of T, and r h A = B : K for T h A < B : K 
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and F h B < A : K. We may also use the metavariable / to range over statements (right4iand sides of 
judgements) of any of these judgement forms. 

The rules of inference are presented as simultaneously defined inductive relations over the judge- 
ments. We start with several admissible structural rules, and follow with the kinding and subtyping 
rules. 



2.2.1 Kinding Rules 



The following rules formalize the judgement T h A : K, stating that the type A is well- formed and of kind 
K in context T. 

()I-T*:* 
F\-A:K X(f_ dom(r) 



T^B:K 



r,X<A:£hT*:* 
T ] rA<B:K X<A:KeT 



FhX B :K 

T,X :K\-A:K' 
r h AX : KA :K^?K' 
rhA:K^K' r^B:K 
T\-A(B):K' 

ThA:* rh£:* 
ThA^B:* 
F 1 X<A:K\-B:* 



T h VX < A : K.B : * 



(TopEmp) 
(TopExt) 



(TVar) 
(TABS) 
(TApp) 

(Arrow) 
(All) 



Notice that in rule TVar it is possible that X £ FV(B) when r h Xg : K. The premise that FhB.K 
in this rule ensures that Subject Reduction goes through smoothly, without needing to refer to subtyping. 



2.2.2 Subtyping Rules 

Finally, the following rules formalize the judgement T h A < B : K, stating that type A is a subtype of 
type B and both are well-formed of kind K in context T. 

F\-A:K 

(S-Refl) 
(S-Trans) 
(S-Top) 
(S-TVar) 
(S -Promote) 
(S-TAbs) 



FhA<A:K 

T\- A<B:K r^B<C:K 
ThA <C:K 

r^A:K 
T h A < T K : K 

F\-A<B:K F\-B = C:K X<A:KeT 
r\-X B <X c :K 

T\-A<B:K F\-B:K X<A:KeT 
T h X B < B : K 

r,X :KhA<B:K' 



T h AX : KA < AX : K.B : K -> 
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T^A<C:K^K' F\- B ~D : K 



ThA(B) <C{D) :K' 



(S-TApp) 



rhAi -^A 2 <Bi -+B 2 :* 



(S -Arrow) 



ThA = C:K T,X <A:K\~B<D:* 



(S-All) 



r h VX < A : K.B < VX < C : K.D : * 
T,X :K\-A:K' F\-B:K 



(S-BetaL) 



T h (AX : KA)(B) < A[B/X] : K' 
F,X :K\-A :K' F\-B:K 



(S-BetaR) 



FhA[B/X] < {AX:KA)(B):K' 



3 The Algorithm 



In this section we define the algorithm for kinding and subtyping. 

First, we define the relations — > w for weak-head reduction and -^>„ for reduction to normal form. 

Definition 3.1 • (AX : K.A)(B) -> w A[B/X\. 

• A(B) ^ w C(B)ifA^ w C. 

• ^> n TV 

• X A (Bi,...,B n ) -^ n X c (Di,...,D n ) if A -^ n C andB t -^ n Difor \<i<n. 

• AX : KA -^>„ AX : K.B if A B. 

• A->B^> n C->DifA^> n C and B -»„ D. 

• VX < A : K.B -»„ VX < C : K.D if A ^>„ C andB^ n D. 

• A C if A — > w B and B -^>„ C. 

We also write A \, n B iff there is a C such that A -»„ C and B -»„ C, and A |„ iff there is a C such that 

A ~*n C. 

Lemma 3.2 If A -^> n C and A\>B then B -»„ C. 

The algorithm has a judgement for kinding, r \- A A : K, and two judgements for subtyping, \- A A <w B 
for subtyping weak-head normal forms, and \- A A < B for subtyping arbitrary types. The judgement 
r \~a A : K corresponds to type inference: the context T and type A are inputs, and the kind K is an 
output. The algorithm for subtyping is analogous to untyped conversion in the A -calculus: it is purely 
a computational relation, without reference to kind information. Furthermore, the algorithm for kinding 
does not refer to subtyping, because subtyping is used for the term language of and not for types. 

The algorithm is defined by the following rules of inference. It is syntax-directed, and it will be shown 
to be terminating on well-formed types. Clearly types do not need to be well-formed to be subjects of 
the algorithmic subtyping judgement. Since algorithmic subtyping incorporates weak-head reduction, it 
is also clearly not terminating in general. 



T h A T* : * 

x<A-.Ker 



(AT-TOP) 



Th A X A :K 



(AT-TVar) 



r,X : K \- A A : K' X dom(r) 



(AT- TABS) 



T h A AX : KA : K -> K' 
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T^ A A:K^K' T^ A B:K 
T\- A A{B):K' 

r \- A A : * r \- A B : * 



Fh A Ai :K 
r,X <Ai :Kh A A 2 :* 



X <£ dom(r) 



r\- A VX <Ai : A".A 2 :* 
HV(A) undefined and A is not an abstraction 

\-aA< w T* 

X A (B u ...,B n )UXc{Di,-,Dn) 
h A X A (B u ...,B n ) < W X C (D U ...,D„) 

\- A A(B u ...,B n )<C Cy„X A (B u ...,B n ) 

l~A X A (Bl,...,.B„) <W C 

\- A A <B 



\- A AX : K.A < w AX : K.B 

h A fii<Ai h A A 2 <fi 2 
h A A! ^A 2 < w Bi -^B 2 

MUBi ^ A A 2 <B 2 
h A VX < A x : KA 2 < w \JX<B X : K.B 2 

h A C < w D 



B^ W D 



(AT-TApp) 



(AT-Arrow) 



(AT-All) 



(AWS-Top) 



(AWS-TVar) 



(AWS -Promote) 



(AWS-TAbs) 



(AWS -Arrow) 



(AWS-All) 



(AS-Inc) 



h A A <B 

The rule for subtyping of bounded variables, S-TVar, states that X A is less than Xb if A and B are 
equal. The algorithmic presentation of this system needs a side condition, independent of the algorithm, 
to determine whether to apply rule AWS-TVar or rule AWS -Promote: in our presentation of Kernel 
F®, this is whether the left- and right-hand sides are convertible. On the other hand, a translation of 
Full F® with bounded variables requires the premise r h B < A : K in AWS-TVar to demonstrate the 
equivalence of the explicit and unlabeled type presentations. However, the algorithm cannot be defined 
with this premise, because the negation of this premise in AWS -Promote requires a negative occurrence 
of the judgement being defined, which is not a valid inductive definition. 

Also, observe that while the comparison A J4 X A (B\, ...,B n ) in AWS-Promote might hold if either 
side diverges, in practice the algorithm will always be applied to well-formed terms, which will be shown 
to be terminating. 



4 Metatheory 

In this section we develop the basic metatheory for the algorithm. 

We begin with the relations A >p B, formalizing a use of promotion, and A >$ B, formalizing B an 
immediate subterm of A, both for A weak-head normal. 



Definition 4.1 

• X A (B u ...,B n ) > P A(B u ...,B n ). 

• - AX : K.A >5 A. 
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- A\ — )■ A 2 >sM and Ay —^A 2 >sM- 

- VX <Ai :K.A 2 > S A 2 . 

Definition 4.2 (Strong Normalization, Termination) We define the following predicates inductively: 

• SN(A) iffSN(B) for all B such that At>B. 

• T(A) iff T (B) for all B such that A >B, A > P BorA > S B. 

The predicate T(A), or A is terminating, formalizes the possible types that the algorithm may en- 
counter when invoked on a judgement containing A. As for strong normalization, a base case for T(A) 
would be a type with no reducts. 

Lemma 4.3 We have the following properties ofT(-) and SN(— ): 

1. T(A) implies SN(A). 

2. T (A) implies A \, n . 

3. lfT(A)andA>B then T (B). 

4. r(T*). 

5. T(A) iff T (AX : K.A). 

6. T(T K ). 

7. T(A) and T(B) iffT(A B). 

8. SN(A) and T(B) iffT(WX<A: K.B). 

9. IfA> P B then T{A) iffT(B). 

10. SN(A i )forl<i<niffT(X TK (A h ...,A n )). 

11. lfT(A), T(B), A(B) -+ w C and T(C) then T(A(B)). 

Proof The only case that is difficult is Case[TT] which follows by standard A -calculus properties. 

Proposition 4.4 (Decidability) lfT(A) and T(B) then \~a A < B and \~a A <w B terminate. 

Proof By induction on the sum of the length of the derivations of T(A) and T(B). 

There are two cases: either A or B has a weak-head reduct or they are both weak-head normal. In the 
first case the result follows by induction hypothesis. Otherwise, by inspection: 

• A = T*. If B = T* then \~a A < B succeeds, otherwise it fails. 

• A = Xc(D\, ...,D n ). If B = Xe(F\, ...,F n ) then C |„ E and D, |„ Fj terminate, since T(A) implies 
A |„ and T(B) implies B so if these conditions hold then \~a A < B succeeds. 

Otherwise, Xc(D[, ...,D n ) >p C(D[, ...,D n ), so \~a C(D\, ...,D n ) < B terminates by induction hy- 
pothesis, so \~a A < B succeeds or fails as \~a C(D\ ,...,D n ) < B does. 

• A=Ai -^A 2 . If B = T + then \~aA<B succeeds. \fB = B { B 2 then h A Bi <Ai andh A A 2 <B 2 
terminate by induction hypothesis, since A\ — > A 2 >$ A- for i G {1,2} and similarly for B, so 
\~a A\ — > A 2 < B\ — > B 2 terminates. 

Otherwise, \~a A < B fails. 
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• A = VX < A\ : K.A 2 . If B = T* then h A A < succeeds. If B = VX < B x : K.B 2 then A x U B { 
terminates because T{A\) and T{B\) imply A\ l n and Bi \. n . Furthermore, VX < A\ : K.A 2 >s A 2 
and VX < B\ : ^.#2 >s B 2 , so h A A2 < B 2 terminates by induction hypothesis, and h A VX < A\ : 
K.A 2 < VX < B\ : ^.#2 succeeds or fails as h A A 2 < B 2 does. 

Otherwise, h A A < B fails. 

• A = AX : KAi. If B = AX : K.B X then AX : K.A Q > s A and AX : K.B Q > s B , so h A A x < By 
terminates by induction hypothesis, and \~a VX : K.Aq < VX : K.Bq succeeds or fails as \~a A\ < B\ 
does. 

Otherwise, h A A < B fails. 
Lemma 4.5 (Reflexivity) IfT(A) then \- A A < A. 

Proof We show \~a A <w A for weak-head normal A and \~aA<A for all A, by induction on T(A). 

If A is weak-head normal then the proof proceeds by case analysis. For example, suppose A = 
X B (Ci,...,C n ). Then SN(fi) and SN(Q) for 1 < i < n, so B U B and Q i„ Q, so h A X B (C U ...,C n ) < w 
Xg(Ci, ...,C„). 

For arbitrary A, A B and h A B <iv B immediately if A is weak-head normal or otherwise by 
induction hypothesis. 

Lemma 4.6 (Subject Conversion) If\- A A < B, A |„ A', and B \. n B' then h A A' < B'. 

Proof By induction on derivations of h A A < B, using Church-Rosser for AWS -Promote. 

Lemma 4.7 (Normalization) If\~A A < B then there are A' and B' such that A -^> n A' and B B' . 

The following lemma simply states that promotion is always valid, even if the side condition of AWS- 
Promote is not satisfied. This is true because if the side condition is not satisfied then AWS-TVar can 
be applied. 

Lemma 4.8 (Promotion) If\- A B<C,A> P B and there is a D such that A D then h A A < C. 

Proof By Normalization there is a D such that C D. If C l„ A then h A A < C by AWS-TVar, and 
otherwise h A A < C by AWS -Promote. 

Lemma 4.9 (Transitivity) Ifh A A < B and h A B <C then h A A < C. 

Proof By induction on derivations, using Normalization and Subject Conversion in AWS-TVar and 
Promotion in AWS -Promote. 

The length of a derivation T(A) includes uses of reduction. To prove Anti-Symmetry, we need a 
measure that is invariant under reduction but respects >p. 

Definition 4.10 We define an alternative length measure \ T(A) \ of a derivation ofT{A) inductively as: 
max({ \T(B)\forB such that A > B}U {\T(C)\ + I for C such thatA> P C}) 

Observe that \T(A) \ does not depend on >j. 

Lemma 4.11 We have the following properties of the predicate T(A) and the measure \T(A) \ of deriva- 
tions ofT(-): 

1. IfT(A) and A -^ n Bthen \T(A)\ = \T(B)\. 

2. IfT(A), T(B) and A U B then \T(A)\ = \T(B)\. 
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3. IfT(A) and A>B then \T(A)\ = \T(B)\. 

4. |r(T*)| = a 

5. IfT(AX : K.A) then \T{AX : K.A)\ = 0. 

6. IfT(A 1 ->A 2 ) then \T(A X -> A 2 )\ = 0. 

7. IfTtyx < Ai : K.A 2 ) then \T(\/X < A\ : K.A 2 )\ = 0. 

Lemma 4.12 (Key Lemma) IfT(A), T(B) and h A A < B then \T{A)\ > \T(B)\. 



Proof By induction on h A A < B, using Lemma 4. 1 1 



Specifically, notice that the Key Lemma allows us to prove directly that \- A A{B\ ,...,B n )< X A (B\ ,...,B n ) 
is impossible, since \T(X A (Bi, ...,B n ))\ > \T(A(Bi,...,B n ))\ by definition. We use this fact in the proof 
of Anti-Symmetry. 

Lemma 4.13 (Anti-Symmetry) If\- A A < B, h A B < A, T(A) and T(B), then A |„ B. 

Proof By induction on derivations \- A A < B and \- A B < A. 
We consider two cases: 

• AWS -Promote is used in deriving \- A A < B. This is a base case with no use of the induction 
hypothesis. By the Key Lemma \T(A(B U ...,B„))\ > \T(C)\, and \T(C)\ > \T(X A (Bi,...,B n ))\, 
so \T{A(B u ...,B n ))\ > \T{X A (B u ...,B n ))\. However, \T{X A (B u ...,B n ))\ > \T(A(B u ...,B n ))\ by 
definition, which is a contradiction by trichotomy. 

• AWS-TAbs is used in deriving \- A A<B and \~ A B < A. Then A = AX : KA\ and B = AX : K.B { 
for some A\ and B\, with \- A A\ < B\ and \- A B\ <A\. By definition AX : KA\ >s A\ implies 
T(Ai) and similarly T(B\), so by induction hypothesis Ai |„ B\, and so AX : KA\ |„ AX : K.B\. 

5 Completeness of the Algorithm 

We now perform the logical relation proof to show completeness and decidability of the algorithm. 

Definition 5.1 (Semantic Object) A type A is a semantic object at kind K, written SOr{A), iffT{A) and 
^aA<T k . 

As for typical strong normalization proofs, this notion of semantic object does not require well- 
formedness. 

Definition 5.2 (Interpretation) The interpretations of a kind K, \=A£K and |= A < B 6 K, are defined 
by induction on K: 

• \=A£*iffSO+{A). 

• |= A < B g * iffSO*(A), SO±{B) and h A A<B. 

• |= A G K K' iffSO K ^K' (A) and \= A(B) £ K' for all B such that \= B G K. 

• \=A<BeK->K' iffSO K ^ K i (A), SO K ^ K , (B), \- A A <B and \=A(C) < B(C) G K' for all C such 
that \= C G K. 

The interpretation extends to parallel substitutions |= y G T as follows: 

• h € ()■ 
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• |= y[X ^Y]eY,X<A:Kijf\=yeT. 

• |= y[A/X] G r,X : K iff\= y G r and \= A G K. 

Observe that variables in the context of the form X < A : K where A ^ Tk can only take renamings 
[X <— Y]. These variables are never subject to substitution, since they cannot become the bound variable 
of an abstraction. 

Lemma 5.3 (Saturated Sets) The following properties hold for \= A G K and \= A < B G K: 

1. If\=A<BeKthen\=AeKand\=BeK. 

2. If\=AeKthenSO K (A). 

3. If\=A<BeKthenr- A A<B. 

4. lf\=AeK then |= A < A G K. 

5. \=J K eK. 

6. If\=AeK then \=A<T K eK. 

7. lf\=B^K and A >p B then \=A £ K, and similarly for the left- and right-hand sides of\=A< 

BeK. 

8. If\=B€K,T(A) and A — > w B then |= A G K, and similarly for the left-hand side of\=A <B£K. 

9. If |= A' < B' G K, A U A', B |„ B', T(A) and T(B) then \=A<BeK. 
10. If\=A<BeKand\=B<CeK then \=A<CeK. 



Proof By induction on K, using for example Reflexivity for Case[4j Cases[2]and[4]for Case[6j Lemma 4.3 
Case[9]and Promotion for Case [7} and Transitivity for Case[T0| 



Theorem 5.4 (Completeness) Suppose \= y G T. Then: 

• Ifr\-A:Kthen\-A[y] G K. 

• IfT'r A <B : K and\^ A[y\ < B[y] G K. 

Proof By induction on derivations. We consider several cases. 

• TopEmp. By Lemma |53] C ase [5] 

• T Var. If 7 = /o [X <— Y] then by induction hypothesis |= B [y] G K, and X B [ y ] > P B [y] , so |= X B [ 7 ] G K 
by Lemma |53] C ase |7| 

If 7= 7o[A/X] then |= A G Kby definition. 

• TAbs. By definition (= y[X <- Y] G F,X : K, so by induction hypothesis |= A[y[X Y]] G K', so 



T(A[y[X <r- Y}}) implies T((AX : K.A)[y] = AY : K.A[y[X <- Y}}) by Lemma 4.3 



Furthermore, if |= B G K then |= y[5/X] G T,X : ^ by definition. We have \=A[y[B/X}} G by 
induction hypothesis and T(A[y[B/X]]) by definition. Then T((AX : K.A)[y\) above and by 



Lemma|53j and (AX : K.A)[y](B) ^ w A[y[B/X}}, so T((AX : £.A) [/](£)) by Lemma|43|Case[TT 
Therefore, |= (AX : K.A)[y](B) G K' , and so |= (AX : K.A)[y\ £K->K>. 

V. By induction hypothesis \=A[y] G K, and |= 7[X <— F] GT,X < A : K implies |=S[7[X <— F]] e* 
by induction hypothesis, so r(A[y]) and r(fl[y[X <- F]]) by Lemma [53] Then T((VX < A : 
£.5) [7] = VX < A[y] : K.B[y[X <- F]]) by Lemma [43| so |= (VX < A : JT.fl)[y] G * by definition. 

S-Refl. By induction hypothesis |= A G K, so by Lemma 5.3 Case[4] |= A < A G ^T. 
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• S-Trans. By induction hypothesis and Lemma [53] C ase [T0[ 

• S-Top. By induction hypothesis and Lemma [53] Case [6] 

• S-TApp. By induction hypothesis \=A[y] < C[y] G K -> K' and |= B[y] < D[y] G K and |= D[y] < 



B[y] e A'. Then B[y] |„ D[y] by Lemma 5.3 Case [I] and Anti-Symmetry, and by definition 



(A(B))[y] = (A [/])(£[/]) < (C[y] )(£[/]) = (C(fl))[y] G We also know |= C[y] £ K —> K' by 
Lemma[531and \=D[y]eK, so [= (C(D))[y] = (C[y])(D[y]) etf' by definition and SO K , ((C(D))[yJ) 



byLemma |531Ca se[2| Therefore (C[y])(fl[y]) |„ (C[y])(D[y]) implies |= (A(B))[y] < (C(D))[y] G 



X"' by Lemma|5.3|Case[9[ 
Lemma 5.5 |= idr G T. 
Proof By induction on Y. 

Corollary 5.6 (Termination) IfYh A <B:K then T(A) and T(B), and h A A < B. 
Corollary 5.7 (Anti-Symmetry) IfY\- A<B ; K andY\- B <A:K then A |„ B. 
Corollary 5.8 IfYhT K <A:K then A l„ T K . 

6 Correctness 

So far we have not needed any properties of the judgements Y\- 7. We now develop some metatheory 
for those judgements and use the results to prove the correctness of the algorithm. 

Lemma 6.1 (Context) 1. IfY h J then FV(7) C dom(T). 
2. IfX < A : K G Y and Y h ok f/jerc X ^ FV(A). 
5. IfYhJ then T\- okas a sub-derivation. 
4. IfY,Y' ^ ok then Yh ok. 

Definition 6.2 (Renaming) y w a renaming for Y in A j/A h ok, y w a renaming, and y(X) <A[y]:KeY 
for each X <A: K eY. 

Lemma 6.3 (Renaming) IfY h 7 arcc? y jj a renaming for Y in A A h 7[y]. 

Lemma 6.4 #T, T' h ok, Y h A : # andX <£ dom(r, F) ?fcen Y,X <A : K,Y* h ok. 

Proposition 6.5 (Replacement) IfY,X < B : K,Y' \- J, Y h A < B : K and Y h A : K then Y,X < A : 

^,r'h/. 

Proposition 6.6 (Thinning) 7/T, T' h 7, T h A : A" W X £ dom(T, F) ^erc T,X < A : A, T h 7. 



Proof By Lemmas 6.1 and 6.4 T,X < A : A,r' h ok. Observe that idr is a renaming for T,r' in Y,X < 



A : K,Y'- Then Y,X <A:K,Y'\-J by Renaming. 

Proposition 6.7 (Substitution) IfY,X : K,Y' h J and Yh A: K then Y,P[A/X] h 7[A/X]. 

Lemma 6.8 (Subject Reduction) IfY \- A:K and A>B then Y \- A = B : K. 

Proof By induction on derivations. 
Add generation. 
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Proposition 6.9 (Correctness) The algorithm is correct for the declarative judgements: 

• IfT hokandFh A A:K then V h A : K. 

• IfFhA,B:K and h A A< w B then V \- A < B : K. 

• IfFhA,B:K andh A A<B then F\- A < B : K. 

Proof By induction on derivations, using Context and Renaming in AT-TVar; the generation prop- 
erty and Subject Reduction for AWS-TVar; the generation property for AWS-Top and AWS-Promote; 
Subject Reduction and Context Replacement in AWS-All, and Subject Reduction for AS -Inc. 

Corollary 6.10 (Decidability of Subtyping) Subtyping is decidable. 



Proof Suppose T h A,B : K. By Corollary ^6\T (A) and T(B), and so h A A < B is decidable by Propo- 
sition|4.4| and so by Correctness, r h A < B : K is also decidable. 



7 Relationship with Traditional F® 

We now show that the system with the bounded variable constructor is equivalent to the traditional 
presentation of the system, without bounds in the variable constructor. 

Explicitly, we take the syntax of traditional [121 , which differs from the syntax presented here 
only by having a type constructor X instead of the bounded type constructor X A . We write judgements in 
this system as T \~t J, with the decoration T for traditional. The rules of inference are also standard: we 
include two rules here but refer the reader to the standard references for the complete system^] 

F\- T A,B:K A= B B 



T\- T A<B 

r^ T A<B T\- T A(C):K 
rh r A(C) <B(C) 



(TS-Conv) 
(TS-TApp) 



Definition 7.1 (Decoration) Decoration of a type, A r , where V is a context in the system with bounded 
variables, is a partial function that maps types from the Curry-style presentation to the more explicit 
structure a la Church: 



• x l =X B ,ifX<B:Ke T. 

• (A — > B) r = A r — ^ B r . 

• (VX < A : K.Bf = VX < A r : K.B rx ^ A:K . 
r _ a v v aT,x.k 



(AX : K.A) T = AX : K.A 



• (A(B)) r =A r (B r ). 



• (T^) r = T,. 

The extension to contexts, P% is defined in the obvious way: 



• d = ()■ 



• (r,X < A : K) = r ,X < A :K. 

1 However, 1 5 1 has intersection types rather than an explicit rule for TV 
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Definition 7.2 (Erasure) The erasure map is simply the homomorphic extension of the stripping of the 
bound from the variable constructor: 

\X A \ = X 

The extension to contexts is also the homomorphic extension. 

Decoration and erasure have some simple properties, most important of which is that both preserve 
j6 -equality: 

Lemma 7.3 • If A =p B then \A\ =p \B\. 

• If A =p B and A r , B r defined then A r =n B r . 

Now, we can relate the Curry and Church presentations of F®. The proofs rely on standard properties 
of the traditional presentation, for example Church-Rosser for untyped reduction, Generation properties, 
well-formedness of contexts, well-kindedness of subtyping, and uniqueness of kinds. 

Lemma 7.4 (Soundness) • IfT \~t A : K then T d and A rd are defined and T d h A r> : K. 

• IfT h T A < B and Th T A,B:K then T d , A T " , B v ' are well-formed and T d h A T " < B^ : K. 
Proof We cons ider the rule TS-Conv. By Church-Rosser, there exists a C such that A >* C and B D>* C. 



By Lemma[L3] A Y " >* C Y " and B Y " >* C 1 "* . By the induction hypothesis, T d h A r " : K and T d h B T " : K. 
By Subject Reduction, T d h A 1 ^ = C 1 ^ : K and T d h B 1 ^ = C 1 ^ : K, and we have T d h A r ' <C V ' :K and 
p/ 1_ c r rf < B r" . K by definition. Finally, by S-Trans, T d h A r " < B r " : 

Lemma 7.5 (Completeness) • IfT \- A: K then \T\ \~t \A\ : K. 

• IfT\-A<B:Kthen \T\ h T |A| < |B| W |r| h r |A|,|B| : ^. 

Proof We consider the rule S-TApp. By the induction hypothesis, |T| h r |A| < |C|, |r| h r \A\,\C\ : # -)• 
A'', |r| \-j \B\ < \D\, and |T| \-j \B\, \D\ : K. By the kinding rule for type application in the traditional 
presentation and the definition of erasure, we have |T| \-j \AB\, \CB\, \CD\ : K', and by TS-TApp |T| \-j 
\AB\ < \CB\. By Anti-Symmetry B = p D, and \B\ = p \D\ by Lemma[73j so \CB\ =p \CD\. By TS-Conv 
jr| h r \CB\ < \CD\, and finally, by Transitivity, |rj h T \AB\ < \CD\. 

The important metatheoretic results for subtyping now transfer straightforwardly to the traditional 
one. 

Corollary 7.6 • IfT \~j A : K then A is strongly normalizing. 

• Typechecking T \-j A : K and subtyping T \-j A < B : K are decidable. 

• IfT hr A < B, T \~t B < A, T \~t A : K and T\~t B . K then A = p B. 



8 Related and Future Work 

An earlier version of this article was published in the unrefereed proceedings of Henk Barendregt's 
Festschrift [9J. The current paper extends the results of the earlier version by showing the equivalence 
with the traditional presentation of the system. 

In an earlier paper [7j, we considered an algorithm that reduces types to normal form before invoking 
the promotion rule in the algorithm. This makes context replacement trivial for equal types, since they 
have the same normal form and so altering the context does not alter the path of types considered by the 
algorithm. However, this algorithm is not optimal, since it normalizes the head earlier than necessary. 
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That earlier paper also used a typed operational semantics to show termination of the algorithm. This 
gave a more extensive treatment of the metatheory, and the admissibility of thinning, substitution and 
context replacement were consequences of the model. Furthermore, Subject Reduction was straightfor- 
ward in the typed operational semantics. In the current paper, finding the exact formulation necessary to 
show these results in the declarative system T h / turned out to be somewhat subtle, since the kinding 
judgement uses the subtyping judgement for the bounded variable rule. However, the approach using 
typed operational semantics was also longer and less approachable, and involved Kripke models for the 
proof of completeness. We hope that the current paper is clearer by not defining an intermediate system. 

In separate work (8), we also proved anti-symmetry of higher-order subtyping using the typed opera- 
tional semantics. The basic idea of that paper was to include the sub-derivation of replacing the variable 
in a bounded head variable expression Xa(B\, ...,B„) with its bound, A{B\,...,B n ). This idea is captured 
in the current paper by the T(-) predicate. The T(-) predicate is also similar to Compagnoni's approach 
with +-reduction 0, but we do not need to develop the metatheory of a new reduction relation. 

As mentioned in the introduction, Stone and Harper |[T3l use a logical relation defined over sets 
of contexts, instead of the standard logical relations over single contexts, to show termination of an 
algorithm for a type theory with singleton types, £ and IT types, and all of the tj rules. Their work does 
not normalize the singleton types. This is an elegant solution to the problem of varying contexts, but 
it raises the question of why singletons or F® should have different requirements on the Kripke-style 
relation than other type systems. 

Abel 0] has shown equivalence of a subtyping algorithm for higher kinds with polarity by direct 
induction on kinds rather than using a logical relation. 

There are several directions for future work. We would like to show that a Harper-Pfenning- style 
algorithm [10] is correct and complete for the type system. Furthermore, it would be nice to be able 
to prove context conversion and Church-Rosser in the model, as can be done for logical relations for 
equality, rather than proving them for the algorithm and lifting to the model. However, properties that 
follow straightforwardly for equality, such as that |= A = B G K implies |= A = A G K, cannot be shown 
so easily for subtyping. Finally, another candidate type construct that we might study with our technique 
of explicit type information is singleton types, which also have computational behavior expressed in the 
context. 

9 Conclusions 

We have introduced a natural and powerful extension of the syntax of F® and showed that the devel- 
opment of the metatheory is similar to the standard metatheory for type theories, specifically without a 
Kripke-style model and with a simple inductive definition capturing termination of the algorithm. We 
have shown all of the important results for the system, including anti- symmetry, transitivity elimination 
and decidability of subtyping. 
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